Intelligent Information Filtering via Hybrid Techniques : Hill Climbing , Case - Based Reasoning , Index Patterns , and Genetic Algorithms

نویسندگان

  • Kenrick Jefferson Mock
  • Sergio Alvarado
  • Thuyen Nguyen
  • Jennifer Dunham
چکیده

As the size of the Internet increases, the amount of data available to users has dramatically risen, resulting in an information overload for users. This work shows that information overload is a problem, and that data is organized poorly by existing browsers. To address these problems, an intelligent information news filtering system named INFOS (Intelligent News Filtering Organizational System) was created to reduce the user’s search burden by automatically eliminating Usenet news articles predicted to be irrelevant. These predictions are learned automatically by adapting an internal user model that is based upon features taken from articles and collaborative features derived from other users. The features are manipulated through keyword-based techniques, knowledge-based techniques, and genetic algorithms to build a user model to perform the actual filtering. The integration of knowledge-based techniques for in-depth analysis, statistical and keyword approaches for scalability, and genetic algorithms for exploration allows INFOS to achieve better filtering performance than by using either technique alone. Experimental results collected from the prototype of INFOS validate the gain in performance within the domain of news articles posted to electronic bulletin boards.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Information filtering via hill climbing, wordnet, and index patterns

The recent explosion in Internet growth has left many users awash in a sea of information, and has spurred the need for intelligent filtering systems. This paper describes work implemented in the INFOS (Intelligent News Filtering Organizational System) filtering system that is designed to reduce the user's search burden by automatically categorizing data as relevant or irrelevant based upon use...

متن کامل

Hybrid Hill-Climbing and Knowledge-Based Techniques for Intelligent News Filtering

As the size of the Internet increases, the amount of data available to users has dramatically risen, resulting in an information overload for users. This work involved the creation of an intelligent information news filtering system named INFOS (Intelligent News Filtering Organizational System) to reduce the user’s search burden by automatically eliminating Usenet news articles predicted to be ...

متن کامل

Adaptive User Models for Intelligent Information Filtering

As networked systems grow in size, the amount of data available to users has increased dramatically. The result is an information overload for the user. In this project, an intelligent information filtering system reduced the user's search burden by automatically eliminating incoming data predicted to be irrelevant. These predictions are learned by adapting an internal user model which is based...

متن کامل

A Search Based Approach for Overlapping Concept Boundaries

This paper presents techniques to integrate boundary overlap into concept assignment using Plausible Reasoning. Heuristic search techniques such as Hill climbing and Genetic Algorithms are investigated. A new fitness measure appropriate for overlapping concept assignment is introduced. The new algorithms are compared to randomly generated results and the Genetic Algorithm is shown to be the bes...

متن کامل

Comparison of Genetic and Hill Climbing Algorithms to Improve an Artificial Neural Networks Model for Water Consumption Prediction

No unique method has been so far specified for determining the number of neurons in hidden layers of Multi-Layer Perceptron (MLP) neural networks used for prediction. The present research is intended to optimize the number of neurons using two meta-heuristic procedures namely genetic and hill climbing algorithms. The data used in the present research for prediction are consumption data of water...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996